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Abstract 

We study the optimal linear /^-approximation by operators of finite rank (i.e., 
approximation numbers) for the isotropic periodic Sobolev space H s (T d ) of frac¬ 
tional smoothness on the d-torus. For a family of weighted norms, which penalize 
Fourier coefficients f{k) by a weight w S}P (k ) = (1 + ||&||p) s,/p , 0 < p < oo, we prove 
that the n-th approximation number of the embedding Id : H s (T d ) —> L 2 (T d ) is 
characterized by the n-th (non-dyadic) entropy number of the embedding id : £ d —> 
raised to the power s. From the known behavior of these entropy numbers we 
gain a complete understanding of the approximation numbers in terms of n and d 
for all n € IN and d G IN. 


1 Introduction 

Approximation numbers, also known as linear n-width, are one of the fundamental 
concepts in approximation theory. In Hilbert space settings, they describe the worst-case 
error that occurs when we approximate a class of functions by projecting them onto the 
optimal finite-dimensional subspace. Hence, approximation numbers are also of interest 
in the numerical analysis of partial differential equations (PDE) as they provide reliable 
a-priori error estimates for certain Galerkin methods. In this context, approximation 
numbers related to isotropic Sobolev spaces, Sobolev spaces of mixed regularity, and 
spaces of Gevrey type appear naturally. Subject of this paper are preasymptotic bounds 
for these approximation numbers, which substantially improve the known error bounds 
in high-dimensional settings. 

Bounds, which describe the decay in the rank n G IN of the optimal projection 
operator, have been known for decades for the aforementioned approximation numbers. 
In high-dimensional settings, where the functions to be approximated depend on a large 
number of variables d G IN, these classical bounds become problematic. They only 
inadequately capture the effects of the approximation problem’s dimensionality d. We 
state this issue precisely in the subsequent Sections 11.2111.31 and 11.41 For the moment, 
we only stress that this has consequences in at least two respects when the dimension 
d becomes large. For one, the classical bounds are trivial until n is exponentially large, 
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say n > 2 d . This severely limits their applicability. Moreover, from the viewpoint 
of information-based complexity, it is impossible to determine the tractability of the 
approximation problem rigorously. To address any of the two issues, a first step is 
to uncover how the equivalence constants in the classical error bounds depend on the 
dimension d. As it turns out this is often not enough. In fact, one has to determine 
explicitly how the approximation numbers behave preasymptotically, that is, for small 
n < 2 d . This typically involves to find good estimates for complicated combinatorial 
problems that evolve from the structure of the smoothness spaces. As we will see, the 
preasymptotic behavior can be completely different from the asymptotic behavior. 

1.1 An abstract characterization result: counting via entropy 

The essence of this paper is that for certain relevant approximation numbers of peri¬ 
odic isotropic Sobolev spaces and periodic spaces of Gevrey type, it is not necessary to 
compute preasymptotics by hand. Instead we exploit that the approximation numbers 
can be determined by covering certain f A unit balls with f^-balls, a problem which is 
already well understood. In fact, this turns out to be just a special case of a general 
characterization result. This holds true for Sobolev type spaces H w (T d ) defined on the 
d-torus T d = [0, 27r] d , where the smoothness weights w = (w(k)) keZ d take a special form. 
The spaces H w (T d ) are given by 

= y g i 2 ( T 4 ) : w(k) 2 \c t (!) I 2 < 00} 

kez d 

where Ck{f) denotes the kth Fourier coefficient. The weights in the sequence w are of 
the form 


w(k) = V (\\k\\), k€ Z-, ( 1 ) 

where || - || is a (quasi-)norm on R d and (p a univariate, monotonically increasing func¬ 
tion (p with y(0) = 1. The characterization results now states that the approximation 
numbers associated to the embedding Id : H w (T cl ) —y L 2 ( T d ) are bounded for all n 6 IN 
from above and below as 

l/<p{2/e n ) < a n (ld : H w (T d ) -> L 2 (T d )) < l/y(l/(4e n )). (2) 

Here, e n = £ n (id : £ d ,^ —> are entropy numbers. They give the smallest e > 0 such 
that the fy-unit ball can be covered by n f^-balls of radius e. For the precise statement, 
see Theorem 14.31 

1.2 Preasymptotics for isotropic Sobolev spaces 

The first concrete application of the abstract result (J2D yields results for the isotropic 
Sobolev space H s (T d ) with fractional smoothness s > 0. Isotropic Sobolev regularity 
is the natural notion of regularity for solutions of general elliptic PDEs, typically the 
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solution will be contained in H l { T d ) or H 2 (T d ). The space H s (T d ) can be defined as 
H s (T d ) = H Ws ' 2 (T d ), where the weight sequence w Si2 is given by w Sj2 (k) = (1 + ||fc|||) S//2 . 
Note that the norm ||/|7P(T a! )|| = w s, 2 (k) 2 \ck{f)\ 2 is natural in the sense that 

if s G IN, then this norm is equivalent up to a constant in s to the classical norm, which 
is defined in terms of the L 2 -norms of the derivatives up to order s. 

Concerning the approximation numbers a n (Id : H s (T d ) —> L 2 (T d )), the exact asymp¬ 
totic decay in n has been known for decades. In 1967, J. W. Jerome ra proved that 

c M n~ s / d < a n (Id : H s (T d ) -G L 2 (T d )) < C s , d n~^ d , (3) 


with constants c S: d and C S: d that were merely known to depend on the fractional smooth¬ 
ness s and the dimension d. For further references and historical remarks in this direc¬ 
tion, see the monographs by Temlyakov [25 J and Tikhomirov [26]. 

In order to obtain preasymptotics for (J3) and clarify the d-dependence of the con¬ 
stants, we not only consider the weights w Sj2 but the family of weights w SyP given by 

w S}P {k)= (1 + Sj=i \kj\ p ) s,p if 0 < p < oo, and ^ 

w a ,p(k) = max(l, |fci|,..., \k d \) s if p = oo. 


For 0 < p < 1, the weights w sp can be interpreted as imposing a compressibility con¬ 
straint on the Fourier frequency vectors; the less k G is aligned with one of the 
coordinate axes, the stronger the penalty through a large weight w StP {k). The function 
spaces H s,p (T d ) := H Ws ’ p (T d ) coincide as sets with the classical isotropic Sobolev space. 
Applying the abstract result ([2])! we immediately obtain Theorem 11.11 


Theorem 1.1. For 0 < p < oo and s > 0, we have 


a n {ld : H s ' p {T d ) -> L 2 ( T d )) 


'S,p 


Iog(l+d/ log n) 
log n 


s/p 


d -s/p n -s/d 


: 1 < n < d 
: d < n < 2 d 
: n > 2 d . 


It is clearly visible how a smaller compressibility parameter p makes the approximation 
problem easier by amplifying the preasymptotic logarithmic decay in n. The equivalence 
constants in Theorem 11.11 depend only on s and p and can be completely controlled. In 
particular, we have the limit result 


lim n s/d a n (ld : H s ’ p (T d ) L 2 (T d )) = (vol (B d )) s/d x d~ s/p , 

n—>oo 1 


see Corollary 15.21 A consequence of Theorem 1 1.1 1 is that we face the curse of dimension¬ 
ality in the strict sense of information-based complexity if and only if p — oo. Otherwise, 
the approximation problem is weakly tractable , despite the slow asymptotic decay n~ s ^ d . 
For details, see Section [7] 
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1.3 Spaces of Gevrey type and a connection to hyperbolic cross 
spaces 

The classes of smooth functions that are nowadays called Gevrey classes were already 
introduced in 1918 by M. Gevrey [6], they occurred in a natural way in his research on 
partial differential equations. Since then they have played an important role in numerous 
applications, in particular in connection with Cauchy problems. The recent paper pH] 
introduces the periodic spaces of Gevrey type G a ^ ,p {T d ), 0 < a,f3,p < oo, which consist 
of all / G C'°°(T d ) such that the norm 

||/|G“A»(T')|| := (y exp^llfcipM/JlA 

\kez d ) 

is finite. Here Ck(f) denotes the Fourier coefficient with respect to the frequency vector 
k = (ki,...,kd) G h d , defined in Section [2] below. For 0 < a < 1, the spaces G a, P' p (T d ) 
coincide with the classical Gevrey classes and contain non-analytic functions, while for 
a > 1 all functions in G a,p,p (T d ) are analytic. Some more background on Gevrey classes 
and references can be found in Section [6j 

In Theorem 16.11 we prove lower and upper bounds for the approximation numbers 
a n of the embedding Id : G a,/3,p ( T d ) —» L 2 (T d ) for all n G IN and arbitrary parameter 
values a, j3 > 0, 0 < p < oo. Due to our proof technique we can determine the rate of 
convergence only up to a constant. However, for 0 < p < oo and a < min{l,p}, we at 
least obtain an indication for the correct asymptotic behavior by the limit statement 

lim a n ■ exp(A f5n a ^ d ) = 1, 

n—>oo 

where A := vol (B d )~ a ^ d , see Theorem 16.21 

What concerns preasymptotics the bounds turn out to be rather surprising in the 
particular situation a = p. For 1 < n < 2 d , we obtain the two-sided estimate 

n i°s(i+d/< a n (Id : G p,/3,p (T d ) — y L 2 (T d )) < n 1 °s( 1 + d / lo s(")) < n I °ii T + 3 I. (5) 


This estimate is almost identical to the preasymptotic estimate which has been obtained 
in the recent paper [TB] (see also ((TJ) below) for approximation numbers of the embeddings 


Id : H r mix (T d ) L 2 (T d ), 


where Hf aix (T d ) is the Sobolev space of dominating mixed smoothness equipped with 
the norm 


l^; i x(T‘')ll := 


£w/)i 2 n( i+ fe 

ke1* d 3= 1 


1/2 


It is rather counterintuitive that the approximation numbers behave almost identically 
in the preasymptotic range. After all, the spaces of Gevrey type G p,p,p (T d ) contain 
substantially smoother functions than the space -Hm ix (T d ). We discuss this in more 
detail in Section 16.11 and give at least partial explanations for this odd phenomenon. 
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1.4 Preasymptotics for embeddings into H s 

Yserentant ra proved that eigenfunctions of the positive spectrum of the electronic 
Schrodinger operator possess a dominating mixed regularity. To solve the electronic 
Schrodinger equation numerically, Galerkin methods combined with sparse grid tech¬ 
niques mu are widely used. The discussion in Subsection 11.51 below shows that one is 
particularly interested in measuring the error in the energy space H 1 . From results in 
[ 8 ] it follows that 

CdrT (r-1) < a n (H r mix (T d ) -> H\T d )) < C d n~^- l \ ( 6 ) 


with constants Cd, Cd depending on the dimension d. In [3] [ 8 ] it has been observed 
that Cd = d 2 0.97515 d . This result suggests that the truncation problem even gets easier 
with a growing number of electrons. However, as [3] shows, the constant Cd can only be 
chosen as above for exponentially large n > (1 + n f) d . This raises the question how the 
approximation numbers in (( 6 ]) behave preasymptotically. 

Unfortunately, the abstract result (J2]) cannot be applied to obtain preasymptotics 
for dHD, since the space H^ ix (T d ) cannot be written as a space H w (T d ) with a weight 
sequence w of the form (jTO . However, the observations described in the previous subsec¬ 
tion and results in [IB] give an indication for the preasymptotic behavior. The results 
in m Thm. 4.9, 4.10, 4.17] provide the two-sided estimate 


\ \ 2+log(l/2+d/ log(n)) 

2 n) 


4+2 log2 d 


— °n(Id : HC x (T d ) —)■ L 2 (T d )) < 


n 


(7) 


in the preasymptotic range 1 < n < 4 d . With the coincidence 

«n(Id ; H r mix (T d ) -)> H^ ix (T d )) = a n (Id : T d ) ^ L 2 (T d )) : 

provided r > s > 0, we obtain from ((7|) by embedding 


a n (Id : H r mix (T d ) H s (T d )) < 



( 8 ) 


The connection between spaces of Gevrey type and spaces of dominating mixed smooth¬ 
ness sketched in Subsection Ol (see also Subsection 16.II below), might be useful to refine 
the result (JSJ). Indeed, for spaces of Gevrey type we obtain the following result as a 
consequence of our abstract technique. For 1 < n < 2 d , we have 

c~\ {3 co r 

Cl A(n,d) s/ 2 n“T^T < a „(Id : G 2 ’ r/ 2 ’ 2 ( T d ) H s (T d )) < c 2 A(n, d) s/2 n ~^, (9) 

where A (n,d) = and 7 (n,d) = log(l + rf/log(n)); compare with ([3]). 
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1.5 Approximation numbers and Galerkin methods 

To conclude this introduction, let us outline the connection between approximation 
numbers and reliable a-priori error estimates for Galerkin methods. Consider a general 
elliptic variational problem in H s = H s (T d ), which is given by a bilinear symmetric 
form a : H s x H s —> R and a right-hand side / 6 H~ s . The bilinear symmetric form is 
assumed to satisfy, for any u,v G H s , 

a{u , v) < /ii|| u |il s || || v |i/ s || and a(u , u ) > /Lt 2 1| u |H S || 2 . 

Under this assumption, a(-, •) generates the so called energy norm equivalent to the norm 
of II s . The problem now is to find an element u G II s such that 

a(u,v ) = (/, v) for all v G H s . (10) 

In order to get an approximate numerical solution Galerkin methods solve the same 
problem on a finite dimensional subspace 14 in H s , 

a(u h ,v ) = (/, v) for all v G V h . (11) 

By the Lax-Milgram theorem m the problems (TTU |1 and (1TTT) have unique solutions u* 
and u* h , respectively, which by Cea’s lemma [I], satisfy the inequality 

\\u* — u* h \H s \\ <(^ 1 /^ 2 ) inf ||w* — v |^ s ||. (12) 

>■( \'h 

The naturally arising question is how to choose the optimal n-dimensional subspace 14 
and linear finite element approximation algorithms such that the right-hand side in (TT 2 l) 
becomes as smalls as possible. Under the assumption that the solution u* is contained 
in the unit ball of some smoothness space U C H s , the minimal right-hand side in (fT21) 
is bounded from above by the approximation number a n (Id : U —> H s ). Summarizing, 

\\u*-u* h \H s \\ < (/i 1 /p 2 )a n (Id : U H s ) 

gives a worst-case a-priori error estimate for the optimal n-dimensional subspace 14 • 

2 Preliminaries 

Notation As usual, the set IN denotes the natural numbers, Z the integers and R 
the real numbers. By T we denote the torus represented by the interval [0, 27r] where 
opposite points are identified. A function / : T d —» C is 27r-periodic in every component. 
If / G L 2 ( T d ) then the Fourier coefficient Ck{f) with respect to the frequency vector 
k = (hi ,..., k d ) G is given by 

c k (f) = ( 2 vr )- d / 2 [ f(x)e~ ik - x dx. 

Jr d 
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For a real number a we put a + := max{a, 0}. The symbol d is always reserved for 
the dimension in R d , !N d , and T d . For 0 < p < oo and x G R we denote ||a;|| p = 
iyty \x t \ p ) 1/p with the usual modification in the case p = oo. We write £ d for R d 
equipped with the norm || • || p . By B d we denote the closed unit ball of £ p . When we 
write log, we always mean the logarithm to base 2. If A" and Y are two Banach spaces, 
the norm of an element x in X will be denoted by ||a:|A|| and the norm of an operator 
A : X —* Y is denoted by ||A : X —> Y\\. The symbol X <—)• Y indicates that the 
embedding operator is continuous. 

Approximation numbers Let A", Y be two (quasi-)Banach spaces 
imation number of an operator T : X —> Y is defined by 

a n {T : X Y) inf sup || Tf - Af\Y\\ 

rankA<n \\f\ X \\<l 

= inf \\T-A:X-t Y\\. 

rank A<n 

Covering and entropy numbers Let A C R d . An e-net for A is a discrete set of 
points Xi,... ,x n in R^ such that 

n 

Ac\J(x l+£ Bi). 

i —1 

The covering number N e (A) is the minimal natural number n such that there is an e-net 
for A. Inverse to the covering numbers N e (A ) are the (non-dyadic) entropy numbers 

e n (A,0 := inf{e > 0 : N e (A) < n}. 

If A = Bj| || = {x 6 R d : ||x|| < 1 } is a unit ball, we also use the notation 

e n (id: 4,,—>0 :=e n (Rf||, 0 - 

In the applications which we have in mind || • || will be a classical (quasi-)norm || • || = || • || p 
for 0 < p < oo. In this case, the behavior in n and d of the entropy numbers e n (id : 
t d is completely understood For the reader’s convenience, we 

restate the results. 

Proposition 2.1. For all n G IN. we have 

n~ l/d < e n {id : l\ d x -)• £ d J < 2 n~ l/d . 

However, e n {id '. = 1 as long as n < 2 d . 

Proposition 2.2. Let 0 < p < oo. Then, 

{ 1 : 1 < n < d, 

( l ° i(i rA'° i " > ) 1/ " : d < n < 2 “’ 

d~i/P n -i/d . n >2 d , 

with constants independent of n and d. 


The n-th approx- 


(13) 
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The equivalence constants in Proposition 12.21 are not further specified in the literature. 
It is possible to calculate explicit, but rather lengthy expressions. We refrain from going 
more into detail at this point. In Section [3j we will comment on the behavior of the 
constants for n — > oo. 

Remark 2.3. The closely related entropy numbers £ n (Sp _1 , where S^ -1 = {x G 
R' d : ||x|| p = 1}, have been understood only lately [10, [18]. It is no surprise that these 
behave identically to the entropy numbers e n {id : i d —* i^f), except that asymptotically 
they decay as To prove the bounds on £„.(Sone largely mimics the 

well-known proof for £ n (id : i d —* i^f). Surprisingly, there is one case where this strategy 
fails. For 0 < p < 1 and n > 2 d the familiar volume arguments become inaccurate and 
it needs different techniques to obtain matching bounds, see m- 


Notions of tractability In the course of this paper we want to classify how the 
dimension d affects the hardness of the approximation problem Id : H v '(T d ) —>- L 2 (T d ) 
depending on the weight sequence w. The field of information-based complexity provides 
notions of tractability [Tj5], which rate the difficulty of the approximation problem in 
terms of how its information complexity 

n(e,d) := mf{n G IN : a n (Id : R w (T rf ) —>■ L 2 (T d )) < e} 

grows in l/£ and d. Let us first note that for all weight sequences w considered in this 
paper we have an initial error 

ai(Id : H w (T d ) —>■ L 2 (T d )) = ||Id : H w (T d ) ->• L 2 ( T d )\\ = 1. 

Hence, the normalized (relative) error and the absolute error coincide. Now, the approx¬ 
imation problem is said to be polynomially tractable if n(e, d) is bounded polynomially 
in e^ 1 and d, i.e., there exist numbers C,r,q> 0 such that 

n(e, d) < C e~ r d q for all 0 < £ < 1 and all d e IN. 

The approximation problem is called quasi-polynomially tractable if there exist two con¬ 
stants C, t > 0 such that 


n(e, d) < C exp(f(l + ln(l/£))(l + lnd)). 


It is called weakly tractable if 


lim 

1 / £-\-d-^OC) 


logn(£, d) 
\/e -\- d 


0, 


(14) 


i.e., the information complexity n(e,d) neither depends exponentially on l/£ nor on d. 
We say that the approximation problem is intractable, if (1141) does not hold. If for some 
fixed 0 < £ < 1 the information complexity n(e, d) is an exponential function in d then 
we say that the problem suffers from the curse of dimensionality. To make it precise, 
we face the curse if there exist positive numbers c, £q,7 such that 


n(e, d) > c(l + y) d , for all 0 < £ < £q and infinitely many d e IN. 




3 Counting via entropy 

The grid number G{A ) of a set A C R d is the number of points in A that lie on the grid 
Z d . Formally, 

G(A) =${Anz d ). 

The grid numbers G(rB d A, r G R, are central in the study of approximation numbers 
a n (Id : H w {T d ) — * L 2 (T d )) if the weights w are induced by some (quasi-)norm || • ||, see 
Section H] below. In this section, we show that the combinatorics for grid numbers can be 
reduced to covering arguments, at least if the studied set is solid. We call a set A C R d 
solid if for all x G A every vector y G R d which component-wise fulfills |p,| < |xj| is 
contained in A. For instance, the unit ball B d is solid for any 0 < p < oo. 

Lemma 3.1. For a solid set A C R d we have 


Ni(A) < G(A) < N P (A) 

for any p < 1/2. 

Proof. For x G A, we define component-wise by |ycj j '■= sign x 3 LkilJ- Clearly, 
|| |ycj — x||oo < 1 for any x G A. Since the set A is solid, x € A implies £ A. Hence, 
the intersection Ap\Z d forms a 1-net of A in Consequently, we have N { (A) < G(A). 
The upper bound is a direct consequence of the fact that it needs at least G(A) many 
balls of radius p < 1/2 to cover A D Z d . □ 

A function || • || : R d —> [0, cxd) is called a p-norm for some 0 < p < 1 if || • || fulfills 

the norm axioms of absolute homogeneity and point separation and, furthermore, the 
p-triangle inequality 

\\x + y\\ p < \\x\\ p + \\y\\ p 

holds true for any x,y G R d . The typical example for a p-norm with 0 < p < 1 is 

II ' II = II ' Up- If A C R d is the unit ball of a p-norm || • ||, there is another relation 

between covering and grid numbers. In this relation the quantity 

d 

A ll-ll( ci ) := lE e *|| 

i= 1 

appears, where e 1} ... , denote the canonical basis vectors in R d . Note that if || • || = || • || p 
for 0 < p < oo we have A||.||(d) = d lA . 

Lemma 3.2. Let || • || be a p-norm in R 0 ' for some 0 < p < 1. For r > A||.|i(d)/2, put 

l(r,P,d):= (r-- \ H (d)”/2”) ,/F , 

L(r,p,d) := (r p + \ H (df/V) 1/r . 

With By denoting the unit ball, we have the relation 

N 1/2 (l(r,p,d)B( ll ) < G(rBf H ) < N 1/2 (L(r,p,d)Bf H ). 
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Proof. Let Q k = k + [—1/2, l/2] d . The p-triangle inequality for || • || yields 

l( r ,P,d) B l || C 1J Q k CL(r,p,d)Bf^. 

kez d , 

\\k\\<r 

The left-hand side inclusion shows that the set rBy D Z cZ is a 1/2-net of l(r,p,d)B y 
in This shows the left-hand side inequality of the statement. The second inequality 
follows from the right-hand side inclusion by a simple volume argument. □ 

Lemma 13.21 yields the following bounds for entropy numbers. Note that the upper 
bound is a refinement for large n of the usual upper bound found in the literature, 
compare also with Propositions 12 .1112.21 

Lemma 3.3. Let || ■ || be a p-norm for some 0 < p < 1. For n > (d 1 ^ p /2) d vol(By), we 
have 

i(n/ vol(By )) 1//d < e n (id : -> £ d J < i((n/vol {Bf H )) p/d - 2 1 ~ p d 1 / p )y 1/p . 

Proof. The lower bound is the standard lower bound, which follows from simple volume 
arguments and in fact holds true for all n G IN. To see the upper bound, choose 
r = ((n/ vol(B|.||)) p / d — d p / p /2 p )) l ^ P . Then it follows from the right-hand side inclusion 
of (Il5]i that G{rB d ) < n and further from the left-hand side inclusion of (1151) that 
£ n (id : £ d —> < 1/(2 l(r,p,d)), where l(r,p,d ) is defined in Lemma HOI It remains to 

plug in the formula for r. □ 

Let us briefly come back to the discussion on the equivalence constants in Proposition 
12. H and Proposition 12.21 An interesting question is whether the equivalence constants in 
the lower and upper bounds necessarily have to be different or whether this is just an 
artifact of the used proof techniques. Lemma 13.31 allows a partial answer. In the limit 
n —y oo we have 

lim n 1/d e n (id : £% -> O = 1/2 • vol (BL) 1/d . (16) 

n—»oo 11 11 11 11 

If || • || = || • ||p for 0 < p < oo, then vol(B||.||) 1/,d = vo \(B d ) l ^ d x d -1 / p , see [ 28] . 

4 Characterization of approximation numbers 

In this section, we prove a number of characterization results for approximation num¬ 
bers a n (Id : H w (T d ) —> L 2 (T d )) when the weight sequence w is derived from some 
(quasi-)norm in R d . To begin with, let us recapitulate some well-known facts about 
approximation numbers of weighted spaces. Let w = {w{k)) k& ^d be an arbitrary weight 
sequence such that 1/w := (1 /w(k)) ke %d G £oo(fL d ). It is well-known that the approxi¬ 
mation numbers are given by the non-increasing rearrangement (cr n ) nGlN of the inverse 
weight sequence 1/w, that is, 

On (Id : H w (T d ) —>■ L 2 (T d )) = cr n (17) 
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for all n G IN. We briefly sketch the proof of this fact, details and further references can 
be found in m Section 2.2], Consider the isometries 

A” : H w (T d ) h(Z d ), f (w(k)c h (f)) te „ (18) 

and 

F : £ 2 (Z d ) -G L 2 (T l ), {i k ) k&d ^ (2vr )- d / 2 £ k e ikx , (19) 

kez d 


as well as the diagonal operator 

D w : £ 2 {fL d ) —> £ 2 CZ d ), ^ (£k/u’(k)) k£Z d. ( 20 ) 

Obviously, we have Id = Fo R) w o A w , which is illustrated by the commutative diagram 
below. 


tf w (T^ 




Id 


L 2 (T a 


F 


,(Z a 


D v 


>(Z Q 


Figure 1: Commutative diagram for the embedding Id : i/ w (T d ) —> L 2 (T d ) 


It is known that the approximation numbers of the diagonal operator are given by 
(^n)neiN, and from ||v4 w || = ||F|| = 1 we conclude 

On (Id : H w (T d ) -G L 2 (T 1 )) = a n (D w : £ 2 { Z d ) -G £ 2 (Z d )) = a n . 

We come to our first characterization result. For weight sequences w given by a 
(quasi-)norm || • || on R d , we show that the non-increasing rearrangement (cr n ) ng ]N is in 
fact equivalent up to constants to the entropy numbers e n (id : —» £^). 

Theorem 4.1. Let || • || be some (quasi-)norm on R d such that min^^...^ ||e*|| = 1, 
where e \,..., denotes the canonical basis in R d . Consider the weight sequence w = 
(w(k)) keZ d given by w(k ) := max{l, ||/e||}. For every n 6 IN, we have 

1/2 £ n (id : 4,| -G O < a n (Id : H w (T d ) -G L 2 (T d )) < 4e n (id : -G £ d J. 

Proof. Let (cr n )neiN denote the non-increasing rearrangement of (1 /w(k)) keZ d. By (fTTl) 
we know that a n (Id : H v '( T d ) —» L 2 (T d )) = a n . Since G(mB d ^) = ${k G Z rf : ||/c|| < 
m} = §{k G 1j d : w(k) < m} and w{mei*) = m, where i* = argmin ,; =1 ^ ||e,||, we have 

cr G(mB d l] ) ~ l/ m - 
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Let us first prove the upper bound. For brevity, we write e n = £ n (id : in 

the following. For given n G IN, let £ > £ n and put m := |_l/((2 + J)£)J for some 5 > 0. 
By virtue of Lemma 13.11 we obtain 

n > N £ (B f,,) = NmimBfa) > N 1/{2+5) (mB f,,) > G{mBf H ). 

The monotonicity of approximation numbers yields 


»V. < (mflf ,) = !/ m < 2 ( 2 + <5)e- 


( 21 ) 


Since 5 can be chosen arbitrarily close to 0 and £ arbitrarily close to e n , we reach at 
cr n ^ 4 £ n . 

To prove the lower bound, assume £ < £ n for some n G IN and put m — [T/e]. We 
have n < 7V £ (By) < NiimB^^) < G(mB y), where the last estimate is due to Lemma 
l3Tl Thus, cr n > cr G ( mS d ) = 1/m > 1 /2er. Again, as £ may be chosen arbitrarily close to 
£ n , we have a n > l/2e n . □ 


Remark 4.2. (i) The constants 1/2 and 4 are an artifact of the proof technique. We 
do not claim that these are optimal. 

(ii) The assumption min* = x,...,rf ||e*|| = 1 in Theorem 14.11 has only been made to keep 
the formulation of the statement and the proof as simple as possible. In particular, the 
initial error is always 1. If miiij =1 ^||o|| = c / 1, then the statement still holds true, 
provided we define w(k) := max{c, ||/c||}. Otherwise, i.e. when keeping the definition 
w(k) : = max{l, ||fc||}, the statement holds true for sufficiently large n > no(c,d), where 
no (c, d) can depend on c and the dimension d. 


The statement of Theorem 14.11 can be easily generalized. 


Theorem 4.3. Let ||-|| be some (quasi-)norm on R d as in Theorem \4-l\ and let ip : R —>■ R 
be a monotonically increasing function satisfying 93 ( 0 ) = 1. Consider the weight sequence 


w = <£>(|| • II) given h V w (k) '■= fdlkW)- 

n G IN \ {1}, the estimate 


Writing £ n = £ n (id : £f ,,, —> £%f), we have, for all 


¥>(2/£n) 


< a„(Id : | l-ll)(T d ) ->■ L 2 (T d )) < 


l/(4e„))‘ 


Proof. Let w be the weight sequence given by w(k) = max{l, ||fc||} for k G Z d . Further, 
let (cr n ) ne ]N be the non-increasing rearrangement of 1/w. Note that <£>(||fc||) = ip(w(k)) 
for k 7 ^ 0 since min i=1)d 11 11 = 1. Put 71 = 1 and 


1 


( 22 ) 


for natural n > 1. Since ip is monotonically increasing the sequence ( 7 n )neiN is non¬ 
increasing and thus the non-increasing rearrangement of (1 /w(k))k e z d - It remains to 
combine (l22]l with the finding of Theorem 14.11 □ 
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The constants in the lower and upper bound of Theorem 14.11 do not match. A 
consequence of Theorem 14.41 below is the limit result 

, a n (Id : -> L 2 (T d )) 

lim ---j-—-= 2, 

n^oo £n(ld : 

which suggests that the true constant in the lower and upper bound should be 2 for 
sufficiently large n. 

Theorem 4.4. Let || • || be a p-norm in B d for 0 < p < 1. Further, let (p be given by 
p(t) = exp ((3g(t)) with (3 > 0 and monotonically increasing, differentiable g satisfying 
g( 0) = 0 and lim= 0. Recall the weight sequence w = <^(|| • ||) defined in 
Theorem \f.3\ Using the shorthands a n = a n (Id : iT^(IMI)(T d ) —> L 2 (T d )) and e n = £ n (id : 
,, —> Fff), it holds true that 


lim a n ip{l/ (2e n )) = 1. 

n—>• oo 


Proof. Let (cr n ) ne N be the non-increasing rearrangement of (l/max{l, ||fc||}) fcgZ d and 
(7n)nelN be the non-increasing rearrangement of (l/<^(||/c||)) fcgZ d. Let us first refine the 
upper bound (12Tj) . Consider n e IN sufficiently large such that e n <1/2. With 5 > 0 
arbitrary, £ > 0 such that e n < e < 1/(2 + 6), and m := (_l/((2 + h)£)J, we obtain 

„ . (2 + 5)e 

(?n < °G(. mB d) - 1/m < x _ ( 2 + ( 5 ) e - 

Since we may choose 6 arbitrarily close to 0 and £ arbitrarily close to e n , we obtain 
o n < l/h 1 (l/(2e n )), where h\{t) — t — 1. 

To obtain a refinement of the lower bound, choose for e < e n the natural number 
m = f 1/(2£)~|. Using Lemma 13.21 we obtain n < N e (B d y y) < Ni/ 2 (mB d y^) < G(mB d .y), 
where 

m := (m p +A| M |(d) p /2 p ) 1/p . 

Hence, 

On > 1/m > l/h 2 (l/(2£)), 

where h 2 {t) = ((t + l) p + (A||.||((i) p /2 p ) 1 / p . Since we may choose £ arbitrarily close to e n , 
we obtain l//r 2 (l/(2£ n )) < a n . 

Combining the refined estimates with equation fl22lh we obtain from multiplying by 
p(l/(2e n )) the two-sided estimate 


^(l/(2en)) 

p(h 2 ( l/(2£ n ))) 


< 7n^(l/(2£n)) < 


^(l/(2gn)) 

^l(l/(2£ n )))- 


(23) 


We have In )} ) = P{g(x) - g(x - 1)) = /3g'{£ x ) for some x - 1 < < x. From 

the assumptions on g, it obviously follows that linx^oo g'(£ x ) = 0. Hence, we have 
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lim, woo 7 n 9?(l/(2£ n )) < 1. For the estimate from below we have to show that g(x) — 
g(h 2 (x)) —> 0 for x —> oo. By the mean value theorem, it follows that 

\x - h 2 (x )| < 1 + \x + 1 - h 2 (x )| 

for some /i G [0, A||.||(d) p /2 p ]. Due to p < 1 we further may estimate |x —/^(a?)! < C d x l ~ p 
for some Cd > 0. Combined with another application of the mean value theorem, this 
yields 


I g{x) - g(h 2 (x))\ < |^(0II® - h 2 {x )| < C d \g\0 lx 1 p < C d \g'(^)\^ p , 

where f G [x, h 2 (x)\. Since we have assumed lim^oo g\x)x l ~ p = 0 it follows that 
1 ^ linijj^oQ y n <^9(l/(2£ n )). ID 

5 Isotropic Sobolev spaces 

In this section, we give further details and additional remarks to the results presented 
in Subsection 11.21 of the introduction. 

Proof of Theorem \l.l\ For 0 < p < oo, Theorem 14.31 with || • || = || • || p and <p(t) = 
(1 + t p ) s / p yields 

2 -( 1 + S /D £n (id : £d ^ £djs < an(Id . H s*( T d) ^ < 4 s £ n (id : £ d ->• O s . 

It remains to apply Proposition 12.21 In case p = oo, the argumentation is analogous 
with ip(t) = max{l, t} s . □ 

In the special case p = oo, let us restate Theorem 11.11 with explicit expressions for the 
equivalence constants. 

Theorem 5.1. For p = oo and s > 0, we have 

2 -{i+i/p)s n -s/d < a ^ Id . H s, oo( T dj ^ L 2 (T d )) < 8 s n~ s/d . 

for all n G IN. However, a n (Id : H s, °°(T d ) —> L 2 ( T d )) = 1 as long as n < 2 d . 

Proof. Combine Theorem 14.31 with ip(t) = max{l,t} s and Proposition 12.11 □ 

Theorem 14.41 applied to the approximation numbers a n (Id : H s,p (T d ) —> L 2 (T d )) yields 
the following corollary. 

Corollary 5.2. Let 0 < p < oo and s > 0. Then 

lim n s/d a n (Id : H s ’ p (T d ) —>■ L 2 (T d )) = (vol (B d )) s/d x d~ s/p . 

n—> oo ^ 
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Remark 5.3. This work continues the considerations made in [15j. Theorems rmiTTi 
and 15.II extend [ 151 Thm. 4.3, 4.11, and 4.14], which covered only the cases p — 1, p — 2, 
and p = 2s. Moreover, by Theorem 11.11 we close the logarithmic gap in [ I5l Thm. 4.6] 
and confirm [T5J Rem. 4.7]. 

Remark 5.4 (Approximation in L^). In our results the approximation error is measured 
in L 2 (T d ). It would be highly interesting to have analogs to Theorem ll.il Theorem 14.11 
and Theorem 15.11 for the approximation in L oc (T d ). The recent work [2] provides the 
formula 

(Id : H s ’ p (T d ) -G L q o(T d )) = (^a 2 ) 1 / 2 , 

j>n 

where (cr n ) nS ]N is again the non-increasing rearrangement of the inverse weight sequence 
(1 /w S:P (k)) ke %d. In principle, this allows to prove an analog to Theorem 14.11 However, 
the constants in the known bounds for the entropy numbers of the embbeding id : 0. d —> 
are not good enough to obtain meaningful preasymptotics. 

6 Spaces of Gevrey type 

In this section, we study approximation numbers of spaces G a ^ ,p (T d ) = H w ^^’p(T d ) 
with exponential weights given by 

w a,pA k ) = exp(^||fc||“), k G Z d . (24) 

As already indicated in the introduction, the study of spaces G a, ^ ,p ( T d ) is motivated by 
classical Gevrey classes. Let us elaborate a bit more on this before we discuss our results 
in detail. For the interested reader we note that a standard reference on Gevrey spaces 
and its applications is Rodino’s book m 

The classical Gevrey class G CT (R d ), cr > 1, consists of all / G C°°(R d ) with the 
following property: 

For every compact subset K C there are constants C, R > 0 such that 
for all x G K and all multi-indices a = (ay,..., a^) G INq the inequality 

\D a f(x)\ < CR ai+ - +ad {a l \---a d \) s 


holds. 

If / is 27r-periodic in each coordinate, i.e. if / G C'°°(T d ), the growth conditions on the 
derivatives can be rephrased in terms of Fourier coefficients: / belongs to G cr (R d ) if and 
only if there exists a constant /3 > 0 such that 

exp(2/3 \\k\\l /a )\c k (f)\ 2 < oo . 

Here one can replace ||/c||i by any other (quasi-)norm on H d . This gives only a different 
constant /3, but the exponent 1/cr does not change. This was the motivation in [2] 
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to introduce the periodic Gevrey spaces G a,/3,p (T d ), 0 < a < 1, 0 < /3,p < oo, which 
consist of all / € C°°(T d ) such that the norm 

||/|G ct ’ ftp (T ! )|| := ( £ exp(2/?||*||”)M/)| 2 ) 1/2 
kez d 

is finite. For convenience of notation we changed the exponent, setting a := 1/cr. Clearly, 
all these spaces are Hilbert spaces. 

In the definition of G a ’ p,p (T d ) one can extend the range of parameters to a > 0. The 
decisive difference is that the periodic Gevrey spaces, i.e. those with 0 < a < 1, contain 
non-analytic functions, while for a > 1 all functions in G a ’ p ’ p (T d ) are analytic. 

We come to the first result of this section. As an immediate consequence of Propo¬ 
sitions eh m and Theorem 14.31 we obtain 

Theorem 6.1. Let a,/3 > 0 and 0 < p < oo. Consider the approximation numbers 

On ■= a n {Id : G“^( T d ) L 2 { T d )). 

(i) For 1 < n < d, we have a n x a ,p,p 1- 

(ii) For d < n < 2 d , we have 

log(n) a ^ p 

log(l + d/log(n))_ 


hi(cin) ^ a ,p ft 


(in) For n > 2 d , we have 

— ln(a n ) x aiP f3d a/p n a/d . 

The limit result in Theorem 14.41 can be specialized as follows. Unfortunately, our 
proof technique does not work for classes of analytic functions. 

Theorem 6.2. Let 0 < p < 00 , 0 < a < min{l,p} 3 and f3 > 0. For 

On ■= (Id : G a ^ p (T d ) L 2 { T d )), 

we have 

lim a n ■ exp(/3vol (B d )~ a/d n a/d ) = 1. 

Proof. Let p : = min{l,p}. Further, let h 2 (x) = ((x + 1)^ + d p / p /2^) 1 ^ and h 3 (x) = 
(x p — d p / p /2 l ~ p ) l ^ p — 1. If we put x n = u 1//d vol(Hp) _1 / d , then the two-sided estimate 
(fnil in combination with Lemma 13.31 can be reformulated as 

expOd« - h 2 (x n ) a )) < a n exp((3vol(B d )~ a/d n a/d ) < exp(/3« - h 3 (x n ) a )). 

Copying the arguments given below of (1231) we conclude that lim n ^ 00 (x“ — h 2 (x n ) a ) = 0 
if a < p. Using similar arguments, we also get that lim n _ l . 00 (x“ — h 3 (x n ) a ) = 0 if 
a < p. □ 
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6.1 A connection with spaces of dominating mixed smoothness 

In the special situation a = p, the estimate in Theorem 16.11 (ii) can be written more 
transparently. Namely, for d < n < 2 d , we have constants C\(p) and c 2 (p) such that 

c\(jp)P ,ii i c 20 c 2 (p)P 

n~ i°g(i+d/io g (n)) < a n (Id : G p,p,p ( T d ) — > L 2 ( T“)) < n~ lo ^ 1+d / lo s(")) < n _1 °s( 1+d) . (25) 


We see that the dimension d affects the polynomial decay in n only logarithmically. In 
information-based complexity, such a decay behavior is called quasi-polynomial. This 
observation is highly remarkable for the following reason. The preasymptotic charac¬ 
teristics in (1251) closely resemble the preasymptotics observed in [16] for embeddings of 
Sobolev spaces with dominating mixed smoothness. Concretely, the recent paper [IB], 
involving two of the present authors, studies approximation numbers of the embedding 
Id : //* lix (T rf ) —y L 2 (T d ), where the Sobolev space with dominating mixed smoothness 
H,r d x(T d ) is equipped with one of the—in the classical sense equivalent—norms 


1/2 




E M/)i 


+ I 

o= 1 


\p\2s/p 


V e {1,2}. 


Now, in the preasymptotic range 1 < n < 4 d , the authors of [T6] observe 


2~ s — 
2 n 


1 \ c 1 (p)+log(l/2+d/log(n)) 


C2 (p)s 
2 \ 2 + 1o S2 d 


< a..(M : ffX(n L 2 (T d )) < ( - 


(26) 


where ci(l) = 0, ci(2) = 2, and c 2 (p) = 1/p for p G {1, 2}, see [fH Thm. 4.9, 4.10, 4.17]. 

The close resemblance of (125j) and (1261) is rather counterintuitive. After all, the space 
G p,s ’ p (T d ) contains much smoother functions than the Sobolev space with dominating 
mixed regularity H^f x {T d ), which is clearly visible in the asymptotic decay, see Remark 
16.51 But apparently, the stronger notion of smoothness does not pay off in the preasymp¬ 
totic range. Let us try to gain a deeper understanding of this unexpected relationship 
between spaces of Gevrey type and Sobolev spaces of dominating mixed smoothness. 
From the simple estimate n^=i(l + \kj\ p ) s ^ p < ex P( s /p||^llp) we conclude that we have 
the norm-one embedding 

G p ’S/ p ’ p (T d ) c —} H s J x (T d ). (27) 

Hence, the lower bound in (1251) . with /3 = s/p , yields a lower bound for the approximation 
numbers a n (Id : H/^ x (T d ) —> L 2 (T d )), which is only slightly worse than (126]) with 
regard to the polynomial decay in n. Note that (1261) has been obtained by doing the 
combinatorics explicitly for this special situation, whereas (125]) followed immediately 
from the characterization provided by Theorem 14.31 and the known behavior of the 
entropy numbers £ n (id : £ d — > £^), see Proposition 12.21 

In view of the norm-one embedding (127]) . the surprising part in fact is that the upper 
bound in (1261) is not substantially worse than (1251) . For the simplest case p — 1 and 
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s — 1, there is a good explanation in terms of grid numbers. The interested reader will 
find it easy to generalize this to s > 1. Consider the grid numbers of the f'f-ball 

In (r)B d = {x G R d : exp(||x||i) < r} 


and the hyperbolic cross 


d 

U d := {x G R d : J](l + |x,,j) < r}. 
j =i 

The hrst determine the behavior of the approximation numbers a n (Id : G 1,1,1 (T d ) —» 
L 2 (T d )), since G(ln(r)i?f) = j \{k G < r} (recall the considerations made 

in Section [4]). The latter determine the approximation numbers a n (Id : H^f x (T d ) —$■ 
L 2 (T d )). We will show now that these grid numbers behave sufficiently similar for 1 < 
r < 2 d . Essential ingredient of the proof is the observation that, for l < log 2 (r), the pro¬ 
jections Pi In [r)B d and PiTL d have similar volumes, where Pi : R d — > R/, (xi,..., x^) t->- 
(xi, ... ,Xj). 

Lemma 6.3. Let 1 < r < 2 d . Then, we have 

G(\n(r)B d ) < G(H d ) < rG{cln(r)B d ), 


where c = 1 + l/ln(2). 

Proof. The left-hand side follows trivially from n^=i(l + | x j |) < exp(||x||i). For the 
right-hand side estimate, we hrst note that 


G(n d ) 



A (r,i), 


where A(r,l ) := ft {k G IN Z : J][j =1 (l + fcj) — r }> see [IS Lem. 3.1]. Further, for 2 l < r, we 
have A(r,l ) < vi(r ), where vi(r ) = vol('H( fl {x G R 1 : Xj > 1}), and vi(r) < , 

see m Lem. 3.2], Consider now 

G(r,l ) := {k e IN* : exp(||fc||i) < r}, 

Wi(r ) := vol(ln(r)R[ fl {x G R ; : Xj > 0}) = — . 


For k G if let Q\ = k + [0, l]h From the two-sided set inclusion 

(x G R 1 : Xj > 1, exp(||x||i) < r} C Q[ C (x G R 1 : Xj > 0, exp(||x||i) < r} 

k£G(r,l) 

we obtain, by taking volumes and a change of variables, the two-sided estimate 

w t (r/e l ) < ft G(r,l) < wi(r). 
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G<x,P,P(rfd\ 


Id 


H s ’ p (T d ) 




l(Z d 


D' 


B w=,p 


i(Z a 


Figure 2: Commutative diagram for the embedding Id : G a,l3,p (T d ) —» LP ,p (T d 


Now, using l < ln(r)/ln(2), we observe 

A(r,l ) < u;(r) = rw;_ 1 (r) < ruy(r) < rjjG(eV,/) < rjjG(r c ,/). 

It remains to note that G(cln(r)£>f) = 1 + 2 z (f)i}G(r c , l), which can be seen 

easily by adopting the proof of [TB1 Lem. 3.1]. □ 

Remark 6.4. The direct preasymptotic calculations made in [16] Thm. 4.9] for the 
Sobolev space of dominating mixed smoothness can be adopted for the space of Gevrey 
type G 1,s,1 (T d ) using the elements introduced in the proof of Lemma [6.31 This yields, 
for 1 < n < 2 d , the estimate from above 

a n (Id : G 1,s,1 (T d ) L 2 ( T d )) < 


Compare with (125]1 . 

Remark 6.5. In contrast to the preasymptotic range, the approximation numbers 
a n (Id : G p ’ s / p,p (T d ) —* L 2 (T d )) and a n (Id : F* lix (T d ) —> L 2 (T d )) behave asymptotically 
completely different. On one side, we have the well-known result 

On (Id : RC X ( T d ) L 2 (T 1 )) x d n" a (ln n)^ s 

for the Sobolev space with dominating mixed smoothness, see |l6j and the references 
therein. On the other side, we learn from Theorem 16.11 (iii) that 

q - sdnP /d < a ^ w . G p ’P’ p (J d ) _► L 2 ( T d )) < q 2 sdnP/d , 

where qi = exp(c 1 /p),q 2 = exp(c 2 /p). 

6.2 Preasymptotics for embeddings into H s 

In this section, we consider approximation numbers of the embedding 

Id : G a ’P’ p (T d ) -> H s ’ p (T d ), 
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assuming s < (3a. Here, as before, G a ’^ ,p (T d ) is the periodic space of Gevrey type 
defined by the weight sequence w^ , see (1241) . and H s,p (T d ) is the isotropic periodic 
Sobolev space dehned by the weight sequence w S)P , see (J4J). 

For w an arbitrary weight sequence, recall the operators A w , Z) w , and F defined in 
(USD , m an d (DU) i n Section [H Further, let 

5 w . 4(Z d) ^ ^w (Td)j (2 7r) d F Zk/™(k)e ikx . 

k£7, d 

We can write the embedding Id : G a, ^ ,p (T d ) — > H s,p (T d ) as Id = B Ws ’ p o _D W o 
where w = w^^ p /w SiP , see Figure [5] for an illustration. At the same time, we also have 
Id = F o D w o A w . Hence, recalling the considerations made at the beginning of Section 
[3 it is clear that 

a n (Id : G a ’P ,p (T d ) -> H s ’ p (T 1 )) = a n (Id : H*( T d ) -> L 2 (T d )). 

Note that w(k) = £>(||fc|| p ), where 


ip(t) = exp((3t a )/t s . (28) 

Since we have assumed s < (3a, the function 3p is monotonically increasing for all t > 1. 
Consequently, we may apply Theorem 14.31 and obtain the following worst-case error 
estimates. 

Theorem 6.6. Let a,(3,s > 0, such that s < (3a, and 0 < p < oo. Consider the 
approximation numbers 


a n := a n (Id : G a ^ p (T l ) -)■ H s ’ p {T d )). 


(i) For 1 < n < d, we have a n x a ,p,s,p 1 - 

(ii) For d < n < 2 d , we have 


ln(a„) + s In 


log(n) 


log(l + d/ log(n)) 


^a,s,p 


■1 + (3 


log(n) 


log(l + d/ log(n)) 


a/p 


(in) For n > 2 d , we have 

— ln(a n ) + s ln(d) + s/d\n(n) x ajSjP —1 + (3d a ^ p n a ^ d . 


In case that a = p, (3 = r/p, which is particularly interesting in view of a comparison 
with spaces of dominating mixed smoothness and the discussion in Subsection 11.41 we 
can rewrite Theorem 16.61 (ii) as follows. 
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Corollary 6.7. Consider the approximation numbers 


:= a n (Id : G p ’ r/p ’ p (T d ) ->■ H s ’ p {T d )). 


For 1 < n < 2 d , we have 


ci(p, s ) 


log(n) 


log(l + d/ log(ra)) 


<a n < c 2 (p,s) 


s/p 


Cj(p)r 


77 , plog(l + d/log(n)) 


log(n) 


*/p 


C2(p) r 


77 plog(l+d/ log(n)) | 


Jog(l + d/ log(n))_ 

Proof. Let a = p, /3 = r/p. Then the asserted follows by Theorem 16.61 (ii) 
As a last point in this section we provide the following limit result. 
Theorem 6.8. Let 0 < p < oo, 0 < a < min{l,p}, and (3 > 0. For 

a n := a„(Id : G a ^ p (T d ) ->• H s ’ p (T d )), 

we have 


lim a„ ■ 


exp(/3 vol [B d )~ a / d n a / d ) vol (B d ) s / d 


n 


s/d 


= l. 


Proof. Let e n := £„(id : i d —> i^f). The general estimate (123|) now takes the form 


( 29 ) 


□ 


p{h 2 {l/{2£ n ))) 


^ tin ^ 


^l(V(2£ n )))’ 


where ip is defined in (l28]i and hi, h 2 are dehned in the proof of Theorem 14.41 This can 
be further estimated to 


1 


(2£ n ) s exp(/3h 2 (l/ (2 £„))“) 


A &n C 


(2 £ n ) s exp(/3hi(l/(2£ n ))“)' 


Writing x n = n 1//rf vo\{B d ) 1//d , it is easy to see that plugging in the estimates of Lemma 
13.31 leads to 


1 - 


2 1 ~ p d 1/p 


s/p 


X r 


exp(/3« - h 2 (x n ) a ) < a r 


exp(/3vol(-Bp) a / d n a / d )vo\(B d ) s / d 


n 


s/d 


and 


exp(/3 vo\(B d ) a / d n a l d )vo\(B d ) s l d 


rv 


s/d 


< exp(/3(x“ - h 3 (x n ) a )), 


where h 3 is dehned in the proof of Theorem 16.21 It remains to apply the arguments 
which we already used in the proof of Theorem 16.21 □ 
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7 Tractability analysis 

We conclude this paper with a tractability discussion. The tractability results follow 
more or less immediately from the worst-case error bounds which we have derived in the 
preceding sections. 

Theorem 7.1. Let s > 0 and 0 < p < oo. Then the approximation problem 

Id : H s ’ p (T d ) L 2 (T d ) 

(i) suffers from the curse of dimensionality iff p = oo (for all s > 0), 

(ii) does not suffer from the curse of dimensionality iff p < oo and s > 0, 

(in) is intractable iff p < oo and s < p, 

(iv) is weakly tractable iff p < oo and s > p. 

Theorem 7.2. Let a,/3> 0 and 0 < p < oo. Then the approximation problem 

Id : G a ’P’ p (T d ) -> L 2 (T d ) 

is quasi-polynomial tractable if and only if a > p. 

Before we turn to the proofs of Theorems 17.11 17.21 let us stress at this point that for 
the situations discussed here, the decay of approximation numbers in the preasymptotic 
range determines the tractability. This is a particularly interesting observation regarding 
the isotropic Sobolev space. As we have already pointed out in Section Q3 the asymptotic 
decay 

(Id : H s ’ p (T d ) -+ L 2 (T d )) x s , p , d n~ s ' d 

is often considered a typical indicator for the curse of dimensionality. However, as Theo¬ 
rem 17.11 shows, the approximation problem suffers only from the curse of dimensionality 
in the strict sense of information-based complexity when we equip the isotropic Sobolev 
space with the norm ||- | TP ,00 (T d )||. Otherwise, the approximation problem is weakly 
tractable, despite the bad asymptotic decay n~ s ^ d . For p — 1, 2, 2s this has already been 
observed in [15], see Remark 17.51 Concerning spaces of Gevrey type, it is no surprise 
in light of Section 16.11 that we obtain a similar tractability as has been observed for 
Sobolev spaces with dominating mixed regularity in |l6j . For some further remarks on 
the tractability of Gevrey embeddings, see Remark 17.71 

For the proof of Theorem 17.11 we have to translate the bounds of Theorem 11.11 into 
bounds for the information complexity. These bounds are given in Lemma 17.31 We omit 
the proof, which is technical and lengthy but requires only standard arguments. 

Lemma 7.3. For s > 0 and 0 < p < oo, consider the information complexity 

n{e, d) = min {n 6 IN : a„(Id : H s ’ p (T d ) ->• L 2 (T d )) < e}. 
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(%) From above, we have the bounds 


log n(e,d) < s , p < 


log (d) 

log (d) ( l/e) P//s 

log(l/e) (l/s) p/s 


log(l/e) (l/e) s (p+T') 



= 4(7) <e<4 

: e < £^( 7 ) 


where 7 > 0 and 



log(l + d/ log cZ) 1 



■= C d~ s/p 



The constant C s>p is the same as in the upper bound of Theorem 17.71 
(ii) From below, we have the bound 

logn(e, d) > SjP ( l/e) p,s for £%<£< e\, 


where 



The constant c StP is identical to the one in the lower bound of Theorem 17.71 
Proof of Theorem 7.7 

(i) For n < 2 d , Theorem 15.11 states that a n = 1. Hence, we have n(e,d) > 2 d for all 
e < 1 and the problem suffers from the curse of dimensionality. 

(ii) We show that for there is an £ > 0 such that n(e, d) is polynomial in d. Fix some 
£ > £ 2 - By Lemma 17711 (i), there is C s _ p > 0 such that 



the problem cannot suffer from the curse of dimensionality. 

(iii) To prove intractability is suffices that there is a sequence (£;, df ) ie ] N such that the 
limit in (TTX]) does not exist. Let (dj) ig ]N with di G IN and d* —* 00 for i —> 00 . Let 


£2 c StP (l/di) s/p . Then 


c 



and thus 


log n(£ hi d) ^ (1 /£i) p/s ^ (1 /£i) p/s 

dr + 1/Si - Cs «d+ 1 /£i ~ Cs ’ P ^J {1 /£i)P /s + l/ £ f 
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Finally, since p/s > 1, we have 1/e* < (l/£j) P//s and thus 


log n(£i,di) c 


> 


-s,p 


dk + 1 /£i + 1 

In consequence, the problem is not weakly tractable and must be intractable. 


> 0 for all i 6 IN. 


(iv) We have to show that the information complexity grows slower than both 2 1 / 6 and 
2 d . Put x = 1/e + d. Since both 1/e < x and d < x, we have for all e and all d 
that 

log n(£,d) < C SiP log(x)x p/s . 

Hence, lim^oo logn(e, d)/x — 0 as p < s. 


□ 


The tractability analysis for the approximation problem Id : G a, ^’ p (T d ) —» L 2 (T d ) 
can be reduced to the tractability analysis for the problem Id : H Wp (T d ) —> L 2 {T d ). 
Basis is the following general observation. 

Lemma 7.4. Let w be an arbitrary weight sequence and let 

n w (e,d) := min {n G IN : a n (Id : H w (T d ) -> L 2 ( T d )) < e}. 

For : [0, oo) —> [0, oo) monotonically increasing, consider the weight sequence cp( w) 
groen by <^(w)(0) := 1 and <p(w)(k) := ip(w(k)) for k G Z d \ {0}. We hare 

n v(w) (e,d) = n w (lMl/e),d). 

Proof. Let (cr n )neiN and (7n)new be the non-increasing rearrangements of 
1/w and l/(p(w), respectively. Then, using (j22]h we obtain 

n <r( w )( £ , rf) = min{n G IN : < £} = min{n G IN : l/(p(l/a n ) < £} 

= n w (l/p~\l/£),d). 


□ 


Proof of Theorem \7.2\ With ip(t) = exp (f3t a ) and 7 — p, Lemma 17.41 in combination 
with Lemma 17.31 (i) yields 


In n(e,d) < a ,/3, P 


ln(d) 

: e 1 <£ <1 

ln(rf) ln(l/e) p / Q 

: elf < £ < £' 

ln(ln(l/e)) ln(l/£) p /" 

: £t < £ < £; 

ln(ln(l/e)) ln(l/£) p /( 2 ") 

: £ < ct 


where if = l/</?(l/e[ 7 ). Since in the third case we may estimate ln(ln(l/e)) f/ a ,g,p ln(d) 
due to iff < £ and in the forth case we may estimate ln(ln(l/e)) < a ,p, P ln(l/e) P// ( 2 "\ we 
obtain 


In n{e, d) < a ,p, P In(d) ln(l/£) p/a 
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for all 0 < e < 1 and d £ IN, which leads to quasi-polynomial tractability if a > p. That 
a > p is also a necessary condition for quasi-polynomial tractability follows immediately 
by Lemma 17.41 and Lemma 17.31 (ii). □ 

Remark 7.5. The tractability of approximating the identity Id : H s,p (T d ) —>- L 2 (T d ) 
by finite-rank operators has already been studied in [T5] for p — 1, p — 2, and p = 2s. In 
the case p = 2, however, the authors could not show whether the problem is intractable 
or weakly tractable when 1 < s < 2, see [13 Thm. 5.5, Cor. 5.7]. The results from 
Section [4] allow to close this gap and furthermore to reproduce all tractability results 
obtained in |15| . For a different proof that allows to close the gap, we refer to the recent 
paper [54], 

Remark 7.6. Concerning the standard notions of tractability, asking for compressibility 
of frequency vectors (0 < p < 1) only has the effect that we need less smoothness to 
obtain weak tractability, see Theorem 17.11 (iv). To get a comprehensive understanding 
of the effect of compressibility, we need two additional notions of tractability introduced 
only recently. For a, (3 > 0 a problem is called (a, f3)-weakly tractable [24] if 

logn(e,d) 

Inn — - -r = 0 . 

1/e+d-^t-oo 1 /S a + 

A problem is called uniformly weakly tractable [23] if it is (a,/3)-weakly tractable for 
all a,/3 > 0. From Lemma 17.31 we can conclude that the approximation problem 
Id : H s ’ p (T d ) —>• L 2 (T d ) is (a, (3)-weakly tractable for a > p/s and all j3 > 0 (which 
has also been observed in [241 Thm 4.1]). Hence, if we impose a very strong compress¬ 
ibility constraint—which means that p gets small—then we have almost uniform weak 
tractability. 

Remark 7.7. The recent paper [3] studies the tractability of approximating embeddings 
Id : H w (T d ) —>■ L 2 (T d ) by operators of finite-rank for weight sequences w of the form 
w{k ) = u^ =1 aj \ kj \ 3 , k G Z d , where uj > 1, 0 < a\ < a 2 < a 3 < ... and inf bj > 0. Let 

n w (e, d) := min{n e IN : a n (Id : H w (T d ) L 2 (T d )) < e} 


be the information complexity of the approximation problem. In [3] it is studied under 
which conditions a modified, stronger notion of weak tractability is satisfied, namely 


, In n w (£,d) 

hm -—-————- 

in(i/e)+d —¥oo ln(l/e) T d 


= 0 . 


(30) 


The Gevrey weights w^ , dehned in (1241) . fit into the setting of [3j if a = p (by 
choosing ay = a 2 = • • • = (3 and b\ = b 2 = ■ ■ ■ = p). From [3, Thm. 1] it is immediately 
clear that the approximation problem Id : G a,l3 ’ p (T d ) —>■ L 2 (T d ) is not weakly tractable 
in the above sense if a — p. What more can be said? From Lemma 17.41 we get 

ln /r w “’ p (e, d) In d) 

lim - ; -= lim --—. 

l/e+d. d+l/e 1/e+d, d + \\\(l / £) 
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Hence the approximation problem Id : G a,/3,p (T d ) — > L 2 (T d ) is weakly tractable in the 
modified sense (l30l) if and only if the approximation problem Id : H a,p (T d ) —>■ L 2 (T d ) 
is weakly tractable in the classical sense, which is the case if and only if a > p. We 
conclude that weak tractability in the modified sense (13UD is almost equivalent to quasi¬ 
polynomial tractability for the approximation problem Id : G a, ^ ,p (T d ) —> L 2 (T d ), cf. 
Theorem 17.21 

As a final remark let us point out that other than claimed in |3j the space H w (T d ) 
consists of analytic functions if and only if inf bj > 1. The proof provided in [3] Section 
10] is wrong, and even under the additional assumption inf bj > 1 incomplete as it only 
shows convergence of the Taylor expansion. For a correct proof, see m- 
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